Mixed state dense coding and its relation to 
entanglement measures 

S. Bose x , M. B. Plenio 1 and V. Vedral 2 
1 Optics Section, The Blackett Laboratory, Imperial College, 

London SW7 2BZ, England 
2 Centre for Quantum Computing, Clarendon Laboratory, 
University of Oxford, Parks Road, Oxford 0X1 3PU, England 

February 1, 2008 



Abstract 

Ideal dense coding protocols allow one to use prior maximal entangle- 
ment to send two bits of classical information by the physical transfer of 
a single encoded qubit. We investigate the case when the prior entangle- 
ment is not maximal and the initial state of the entangled pair of qubits 
being used for the dense coding is a mixed state. We find upper and 
lower bounds on the capability to do dense coding in terms of the various 
measures of entanglement. Our results can also be reinterpreted as giving 
bounds on purification procedures in terms of dense coding capacities. 



1 Introduction 

One of the many suprising applications of shared entanglement jl| is superdense 
coding introduced by Bennett and Wiesner . In the simplest example of this 
protocol, two people (Alice and Bob) share a pair of entangled qubits (spin half 
particles or any other two state systems) in a Bell state. Alice can then perform 
any of the four unitary operations given by the identity I or the Pauli matrices 
ci(cx), C2(c y ) and <73(er z ) on her qubit. Each of these four unitary operations 
map the initial state of the two qubits to a different member of the Bell state 
basis. Clearly, these four orthogonal and therefore fully distinguishable states 
can be used to encode two bits of information. After encoding her qubit, Alice 
sends it off to Bob who can extract these two bits of information by performing a 
joint measurement on this qubit and his original qubit. This apparent doubling 
of the information conveying capacity of Alice's qubit because of prior entan- 
glement with Bob's qubit is referred to as superdense coding (which we shall 
henceforth refer to as just dense coding). Dense coding has been implemented 
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experimentally with polarization entangled photons Some generalizations 
of the scheme to pairs of entangled N- level systems in non-maximally entangled 
states (as opposed to qubits in Bell states) [Q, |[ and to distributed multiparticle 
entanglement ^| have also been studied. 

In the simplest example, the power of the method stems from the accessibility 
of all the four possible Bell states through local operations done by Alice alone on 
her qubit |^|, |j| . Obviously this accessibility is related to the fact that the qubits 
shared by Alice and Bob are in an entangled state. So an important question to 
ask is: In what way does the capacity of conveying information through dense 
coding depend upon the degree of entanglement of the initial shared pair of qubits! 
Barcnco and Ekert 0] and Hausladen et al. || have investigated this question 
when the initial shared entangled state is a pure state. Their analysis shows that 
the amount of information conveyed by the dense coding procedure decreases 
monotonically from its maximum value (two bits per qubit) with the decrease of 
the magnitude of the shared entanglement. It becomes one bit per qubit when 
the entanglement becomes zero. 

Recently, there has been a lot of work in quantifying the entanglement of 
systems in mixed states |8[ ||, [l0| . These measures of entanglement have their 



physical interpretation in the process of entanglement purification |11, [12L [L 



A natural question to ask is: in what way does the capacity to do dense coding 
depend on the degree of entanglement as specified by these measures ? Answering 
such a question, in effect, would mean linking up the apparently disconnected 
concepts of purification and channel capacity. In this paper we take a step 
in this direction by obtaining bounds on the number of bits of information 
conveyed per qubit (let us call this C after capacity) during dense coding in 
terms of the various measures of entanglement. It should be noted that C is a 
classical capacity as it quantifies number of bits of information, but the carriers 
of the information are quantum (qubits). 

We also invert our results so that the value of C gives us information about 
the range within which the initial shared entanglement lies. In other words, 
bounds on entanglement measures (and hence on purification procedures) can 
be expressed in terms of the classical capacity C. This might be helpful, because, 
as we shall show, the classical capacity for dense quantum coding is readily 
calculable for certain special classes of dense coding protocols. 



2 The Holevo function as a measure of C 

Suppose, Alice has a set of mixed quantum states Wj at her disposal to convey 
some classical information (sequences of ones and zeros) to Bob. Each state Wi 
can be regarded as a separate letter. Also suppose that Alice sends the state 
Wi to Bob with an a priori probability p-i. The ensemble that Alice uses to 
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communicate with Bob is therefore given by 

W = ^2piWi. (1) 

In the above case, the average number of bits of information that Alice can 
convey to Bob per transmission of a letter state is bounded from above by the 
Holevo function [[u] 

H = S(W)-^2 Pi S(Wi), (2) 

where S(p) = — Tr plogp denotes the von Neumann entropy of the state p 
(Here, and throughout the rest of the paper, log stands for log 2 ). Using a 
different notation, the Holevo function can be rewritten as 

H = Y J PiS{W i \\W), (3) 

where iS(<r||p) stands for Trjcrlogcr — crlog/?} and is known as the quantum 
relative entropy between the states a and p ]l5|. It was recently shown that 
this bound can be achieved in the limit of an infinite ensemble by appropriate 
Block coding (grouping together and pruning long strings of letter states to 
represent messages) at Alice's end and appropriate measurements at Bob's end 

s 

Now consider the case of dense coding. Alice and Bob initially share an 
entangled pair of qubits in some state Wo, which may be mixed. Alice then 
performs local unitary operations on her qubit to put this shared pair of qubits 
in either of the states Wo, W\, W2 or W3. In general, Alice may use a completely 
arbitrary set of unitary operations to generate these states: 

W, = U; g> I W U; ® I. (4) 

In the above equation, Ui acts on Alice's qubit and I acts on Bob's qubit. By 
sending her encoded qubit to Bob, Alice is essentially communicating with Bob 
using the states Wo, Wi, W2 and W3 as separate letters. The number of bits she 
can communicate to Bob using this procedure is thus bounded by the holevo 
function H given in Eq.(||). Moreover, if some block coding is done on a large 
enough collection of qubits in addition to the dense coding, then the number of 
bits of information communicated is equal to the Holevo function. We will thus 
take 

C = H, (5) 

assuming that any additional necessary block coding will automatically be done 
to supplement the dense coding. Exactly the same assumption has been used 
in Ref.|| to calculate the capacity for dense coding in the case of pure letter 
states. Eqs.([|) and ^ define the most general version of dense coding and we 
shall refer to this as completely general dense coding (CGCD). 
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A simpler example of dense coding is the case when the letter states are 
generated from the initial shared state Wo by 



W = I ® I W I ® I, 

W\ = oi <g> I W oi <g> I, 
Wa = cr 2 ® I Wo 02 <g> I, 

W 3 = cr 3 ® I W CT 3 ® I. 



(6) 
(7) 
(8) 
(9) 



In the above set of equations, the first operator of the combination <7i®I acts on 
Alice's qubit and the second operator acts on Bob's qubit. We shall refer to this 
case (i.e when the letter states are generated by Eqs.(||)-(^)) as simply general 
dense coding (GDC). The generality present in GDC is that Alice is allowed to 
prepare the different letter states with unequal probabilities. In other words, 
one has to use Eqs.(|l|)-(||) to estimate the capacity C. 

In the more special case when Alice not only generates the four letter states 
according to Eqs. (](])- ([)])) but also with equal probability, the ensemble is given 



We shall call this simplest case special dense coding (SDC). Among all the 
possible ways of doing GDC, SDC is the optimal way to communicate when 
W) is a P ure state (as we shall show in the next section) or a Bell diagonal 
state. However, we do not know the optimal way to communicate when Wq is a 
completely general state and CGCD is allowed. For most of our paper, we shall 
obtain bounds on the classical capacity C for SDC only. But we shall point out 
those results which are valid for GDC and CGDC as well. Though the main 
aim of this paper is to establish bounds on the capacity C when the letters are 
mixed states, we shall begin with a calculation of C for pure letter states. 



by 




(10) 



and the capacity becomes 




(11) 



3 C for SDC with pure letter states 



Consider the initial shared pure state Wq to be, 



hfo) = (a|00) + 6|ll)). 



(12) 



Then, according to Eqs.(^)-(||), the other letter states are given by 
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|Vi) = (a|10>+&|01», (13) 
l-02> = — *(a|10> — 6|01», (14) 
|^) = (o|00)-ft|ll)), (15) 

from which we obtain Wi = \tjji) (tpi\. As an are P ure states we have 

S(Wi) = 0. (16) 

Thus from Eqs.(||) and (||) we have 

C = S(W). (17) 

We will consider only the case of SDC. Thus the ensemble used is obtained from 
Eq.@ to be 

lal 2 I6I 2 
W = U-|0Q)(0Q| + U-|01)(01| 

lal 2 Ifel 2 
+ U-|10)(10| + ^-|ll)(ll|. (18) 

Thus from Eq.(|T7]) for the capacity C, we get 

C = -(| a | 2 log^ + |6| 2 log!^) (19) 

= l-(|a| 2 log|a| 2 + |6| 2 log|6| 2 ). (20) 

Now we should recall that a good measure of entaglement for a pure state of 
a system composed of two subsystems A and B is given by the von Neumann 
entropy of the state of either of the subsystems jl3| [l7[ . Let us call this measure 
the von Neumann entropy of entanglement and label it by E v . Thus 

KmWU+B) = SCTtaWMa+b)), (21) 

where Tr^ stands for partial trace over states of system A. Therefore, for all the 
states Wi, 

Ev(Wi) = -(M 2 log |a| 2 + |6| 2 log |6| 2 ). (22) 

Thus, 

C = l + E v (Wi). (23) 

We now prove that for pure states, SDC (using all alphabet states with equal 
a priori probability) is the optimal way to communicate among all possible ways 
of doing GDC (i.e when the letter states are generated by Eqs. ([][)- ([)[)). Consider 
the general case when the states Wi are sent with probabilities pi. Then from 
Eq.(0) we have 

W = Pl +p 2 , (24) 
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where 



Pi = (po+P3)|a| 2 |00)(00| + (p +P3)|6| 2 |ll)<ll| 

+ (po-p8)a*6|H>(00| + (po-i»8)o6*|00>(ll|, (25) 

and 

P2 = (pi+P2)|a| 2 |10)<10| + ( Pl +p 2 )|6| 2 |01)<01| 

+ (p 1 -p 2 )a*6|01)(10| + (p 1 -p 2 )a&*|10)(01|. (26) 

Therefore, p\ and pi form separate blocks inside the matrix W and 

S(W)=S( Pl ) + S(p 2 ). (27) 



Eg. (|17p indicates that we need to choose the probabilities pi in such a way 
that S(W) is maximized. Density matrices with same diagonal elements have 
the highest von Neumann entropy when the nondiagonal elements are zero. 
Applying this fact to Eqs.(p5|) and @) we get 

Po=P3, (28) 
Pi=P2- (29) 

Using Eqs.(|2|) and (|2^) in the expressions for p x and p 2 and calculating the 
entropy S(W) gives 



S ( W ) = - E 2 ^d fl | 2 lo S 2pM 2 + \b\ 2 log 2 Pl \b\ 2 ) 

i=0 
1 

= -^2 Pl log2p l -(| a | 2 log|a| 2 + |6| 2 log|6| 2 ), (30) 

i=0 

where the normalizations of the state amplitudes and the probabilities pt have 
been used. From analysis of von Neumann entropies it is well known that the 
expression — J3j=o ^Pi ^°S^Pi nas a maximum value when both po and Pi are 
equal. Thus S(W), and hence the classical capacity C is maximized when 

PO =Pl =P2 =P3 = J- (31) 

Thus, among all the possible ways of performing GDC, SDC is the optimal way 
to communicate when pure states are being used as letters. From the above 
result and Eq. (p3|) we can conclude that in the case of GDC with pure letter 
states we have 

C<l + E v {Wi). (32) 
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This result (Eq.(|32|)) had also been obtained in Ref.Q from logical arguments. 
Following a procedure analogous to the above proof it can be shown that for 
Bell diagonal letter states SDC is again the optimal way to communicate among 
all the possible ways of doing GDC. 

4 Nature of the ensemble used in SDC 

In order to derive a lower bound on the classical capacity C for SDC we need 
to prove a crucial lemma concerning the nature of the ensemble used in SDC: 

For states Wi defined in accordance to Eqs. ®-®), the ensemble W for SDC 
as defined in Eg. is a disentangled state irrespective of the nature of Wo. 

To prove this, we have to start by assuming Wo to be a most general state 
of two qubits. This is given by (l8) 

Wo = hl®I + ^r m a m ®I 

m 

£mnO" m <g) er„], (33) 

where the indeces m and n take on values from 1 to 3. Substituting Wq from 



Eq.Q into Eqs.(|6|)-(|) we get 

Wi = -[I ® I + noi ® I ~ ^ r m a m ® I + I ® ^ s m a m 

+ ^ f 1 W 0"1 ® OVt ~ *mnCTm ® fj, (34) 

n m^l,n 

- [/ ® / + r 2 o- 2 ® / - y^ r ro g m ® i + / ® y^ 

m^2 m 

} J t2nO-2 ®<J n - ^ t mn O- m ® CT„], (35) 
n m^2,n 

- [I ® I + r 3 cr 3 ® I - ^ r rnO- m ® I + I ® ^ S 



w 2 = 



w 3 



m^3 m 

^i3riO" 3 ® <7„ - ^ J mn Ura®ffn]- (36) 

71 m^3,n 



Using expressions for Wj from Eqs.(|33|)-(|36|) in Eq.(|10[) we get 

W = I® \[I + ^a m a m \. (37) 
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This is very clearly a disentangled state. A plausible physical argument to 
support this result can be as follows. We know that an equal mixture of the 
four Bell states is a disentangled state. So it seems highly likely that an equal 
mixture of less entangled states will be disentangled as well. This result is, of 
course, valid only for SDC, where the equal probabilities of the letter states 
result in the careful cancellation of all the entanglement carrying terms. It 
allows us to easily derive a lower bound on C for SDC in terms of one of the 
measures of entanglement. 

5 The relative entropy of entanglement as a lower 
bound on C 

To investigate quantitatively the relationship between entanglement and C for 
arbitrary mixed letter states, it is necessary to use some measure of entangle- 
ment for mixed states. One such measure of entanglement, the relative entropy 
of entanglement, has been introduced in Ref.j8|. It has been shown to have a 
statistical interpretation Q as well as a physical interpretation JlOj , |l9| in form- 
ing bounds on the process of entanglement purification [^, O |l^ |lj . We will 
use the symbol E R to represent this measure of entanglement. For an arbitrary 
mixed state a it is given by 

E R (a)= mm S(a\\p), (38) 

pGT> 

where T> is the set of disentangled states. A property which has to be satisfied 
by any legitimate entanglement measure is its invariance under local unitary 
operations p0[ . E R (a) is thus invariant under local unitary operations of the 
state a. 

As the state W of the ensemble used for SDC has been shown to be a 
disentangled state (i.e. W G T>) in the previous section, we have 

S{Wi\\W) > mmS{Wi\\p) = E R (Wi). (39) 

pGT> 

Using the above inequality in Eq.([ll]) for C of SDC we get 

3 , 3 



C = \Y,Sm\W)>\Y,E R (W l ). (40) 

i=0 i=0 

As each of the states Wi are derived from Wo via local unitary operations only, 
we have 

E R (Wi) = E R (W ) (41) 
for all values of i. Combining Eqs.(f4l|) and ( f40| ) we get 

C > E R (W ). (42) 
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This means that the classical capacity C for SDC is bounded from below by the 
relative entropy of entanglement of the initial shared mixed state. This result 
is however, not generalizable to GDC as it relies crucially on W for SDC being 
a disentangled state. 

6 Average distinguishability as an upper bound 
on C 

In this section, we are going to divert a little bit from the main theme linking C 
to entanglement measures and point out another interesting bound on C stem- 
ming from the mutual distinguishability of the letter states Wj. It is known 
that the relative entropy S'(WjllWj) is a kind of statistical measure of the dis- 
tinguishability between the quantum states Wi and Wj Thus for GDC one 
can define an average mutual distinguishability function as 

5(p i ,W i ) = Y i Y i p i p j S(Wi\\W j ). (43) 

« 3 

We show below that this average mutual distinguishability function forms an 
upper bound on C. Putting W from Eq.(Jj]) into Eq.(||) we obtain 

3 3 

c = J> swwY^Pi Wj) 

i=o j=o 

3 3 3 

= E^scEw w*nl> w j)> ( 44 ) 

»=0 j=0 3=0 

where the factor X^=o Pi nas been inserted before Wi in the second step as 
its value equals unity. We now use the joint convexity property of the relative 
entropy 

s(E A ^nE A ^) < E^Mi*) ( 45 ) 

to expand the right hand side of Eq. (Q) and obtain 

3 

C < E PiPi SiWiWWj) = S( Pl , Wi). (46) 

i,i=0 

Note that the above result is completely general because neither does it require 
the special class of local unitary operations given by Eqs.(p])-(pp, nor does it 
require the probabilities to be uniform. Thus the classical capacity for CGDC 
is bounded from above by the average mutual distinguishability function. Note 
that we are pointing this out just as an interesting bound and it is not linked to 



the main theme of the paper (relating entanglement measures stemming from 
purification procedures to dense coding capacities). 



7 An upper bound on C in terms of the entan- 
glement of formation 

In this section we find an upper bound on C in terms of yet another mea- 
sure of entanglement, namely, the entanglement of formation Ef- Consider a 
decomposition of an arbitrary mixed state a in terms of pure states cr, : 

a = J2 X i a i- ( 47 ) 

i 

Then the entanglement of formation of this state is defined as j?], ||(J 

E F {a) = minY^\E v ((Ti), (48) 

i 

where the minimum is taken over all decompositions of a of the type given by 



Eq.(47) 



The initial shared state Wo used in dense coding, will, in general, have several 
decompositions in terms of pure states. Let the particular decomposition from 
which its entanglement of formation Ep(Wo) is calculated (referred to as the 
entanglement minimizing decomposition pp| ) be 

W =J2<lmW 0m , (49) 

m 

where Wo m are pure states and q m are probabilities. As Eq.(^9|) gives the 
entanglement minimizing decomposition, we have 

E F (W ) = 1 m E v (W 0m ), (50) 

i n 

while the normalization of the probabilities imply 

X>™ = 1 - ( 51 ) 

m 

As each of the signal states W% are derived from Wq by local unitary opera- 
tions, they can be decomposed as 

Wi = J29mW im , (52) 

m 

where each pure state Wi m is connected to the pure state Wo m by exactly 
the same local unitary operation as that which connects Wi to Wq. As any 
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legitimate measure of entanglement has to remain invariant under local unitary 
operations, we have 

E v (W im ) = E v (W 0m ), (53) 

and 

E F {Wi) =E F {W a ). (54) 
/From Eqs.(|5^) and (^34|) we have 

2J q m E v ( W im ) = 22 q m E v ( W 0m ) 

m m 

= £7 F (Wb) = -MWi). (55) 
Now, in the case of SDC, the capacity (from Eq.jTl])) is 

= iE^E^-ii^EE^™) ( 56 ) 

i m j m 

i m m 

where we have put 

W m = \Y,W jm . (58) 

j 

Using joint convexity (Eq.Mq)) to expand the right hand side of Eq.(^7|), we get 



C < i^^g m 5(W jm ||^ m ) 

i xn 

= E^^iE 5 ^™!!^™))- ( 59 ) 

m i 

Now, for each value of m, the expression | SX^imllWm) can be regarded as 
the classical capacity C for SDC with the states Wi m as the four letter states. 
As each of the states Wi m , are pure, Eq.(|23|) implies 



\J2 S( - W ^ W ^ = 1 + E ^ W "^- ( 6 °) 

Putting Eq.@ into Eq.(||) we get 

C<Y,1m^+E v {W im )). (61) 
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Using Eqs.([5l|) and ( p5| ) to simplify the right hand side of Eq. (J6l|) we get 



C<1 + E F (Wi). (62) 

The above bound is valid for GDC as well. To see this one just has to repeat 
the above proof starting with 

C^piSm^PiWj) (63) 

i 3 

and replace Eq.(|58|) with 

W m =^PjW jm . (64) 

i 



Eq.(59) then gets replaced by 

C <Y,<l™{Y,Pi S (W im \\W m )). (65) 

m i 

Now note the fact that 5(Wira||Wm) is the expression for the classical 

capacity C for CGDC with Wi m being the letter states. When the states Wi m 
are generated from W^ according to Eqs.(||)-(||) (i.e. when GDC protocol is 
being followed), then the purity of the states Wi m guarantees that C is less 
than 1 + E v (Wi m ) (as shown in section ||). Using this fact in Eq.(|6^) we again 
end up with Eq.(j6^). Thus even in the case of GDC, the capacity C is bounded 
by \ + E F {W t ). 



8 An upper bound on C in terms of the relative 
entropy of entanglement 

Having analytically proven that l+Ep(Wi) is an upper bound on C for GDC, we 
now proceed to check whether the even smaller (as proved in Ref . fl(i|| ) quantity 
1 + En(Wi) is also an upper bound. However, we do not attempt to prove 
this analytically for a completely general initial shared state Wo- Instead we 
calculate the capacity C (for SDC only) for those specific classes of the initial 
shared state Wo whose relative entropy of entanglement Er(Wo) is already 
known. We then plot this capacity C as a function of the relative entropy of 
entanglement En for each of these classes of states and check whether the curve 
C(Er) lies below the plot of 1 + Er. 

At first we have a look at mixed states of the type jnj 

A A = A|$+)($+| + (l-A)|01)(01|, (66) 
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where |$ + ) is one of the four Bell states which are defined by 

= -L(|00>±|11» (67) 
= i=(|01)±|10)) (68) 



We shall refer to the states described by Eq.(|66j) as lambda states of the type 
A. For them, the relative entropy of entanglement is 

E R (A A ) = (A — 2) log(l - A) + (i _ A) log(l - A), (69) 
while the C for SDC is 



C = (l-A)log(l-A) + i(A-2)log(l-~) (70) 

+ lAlogA+(l + ^). (71) 

The plot of the classical capacity C for these states as a function of Er of the 
state has been shown in figure |l| and it is indeed found that 

C<1 + E R . (72) 

The equality holds true only at the two ends of the graph, namely at maximal 
entanglement Er = 1 (when o\ approaches a Bell state) and zero entanglement 
Er = (when o\ approaches |01)(01|). 
Next we look at states of the type [ [L0| 

A B = A|$+)($+| + (l-A)|00)(00| (73) 

which we call lambda states of the type B. For these ]l0[ ] 

E R {A B ) = s + log s + + s_ log s_ 

- (l-^)log(l-^-^log^ , (74) 

where 



1 ± y/1 - 2A(1 - A) 
«± = g ( > 

are the eigenvalues of Ar. The C for SDC in this case is given by 
C = s + logs-|- + s- logs_ 

- a-5) l0 4 (1 -^-5 l0g r (76) 
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As is clear from a simple comparison of Eqs.(|74|) and (|76|), for lambda states of 
the type B, 

C = 1 + E R . (77) 

Thus, a curious feature of lambda states of type B is that the capacity for SDC 
is actually always equal to 1 + Er. We will discuss a bit more about this curious 
aspect later in the section. 

Now we consider another special class of states called the Werner states 
[Q |l) parameterized by a number F called the fidelity Jt| and given by 

W F = + 

+ |*+)($+| + |$-)($-|). (78) 
The relative entropy of entanglement of these states is given by [[To) 

E R {W F ) = FlogF+(l-F)log(l-F) + l, (79) 
and the C for SDC is calculated to be 

C = 2 + FlogF + (1 - F) log ^ ~ F > . (80) 

o 

The plot C versus Er has been drawn in Fig.^| and it is found that even in this 
case, 

C<l + E R . (81) 
Now consider the case of a general Bell diagonal state 

B D = Ai|*-)(*-| + A 2 |# + )(# + | 

+ A 3 |$ + )($+| + A 4 |$-}($-|. (82) 

It has been proved in Ref.|| that when all Ai S [0, i] then 

Er(B d ) = 0, (83) 

while when any of the (say Ai) > ^, then 

Er(B d ) = Ai logAi + (1 - Ai)log(l - Ai) + 1. (84) 

The C for SDC always turns out to be 

C = 2 + A 4 log Ai. (85) 

i 

First consider the case when all A^ S [0, |]. From Eqs.(^) and ( |85| ) we find that 
1 + Er{B d )-C = -^AilogAi-1 

i 

= -^AiOogA^ + l) 

i 

> 0, (86) 
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where J2i A* = 1 has been used to proceed from the first to the second step and 
logAi < — 1 (because Xi S [0, ^]) has been used to proceed from the second to 
the third step. 

Now consider the complementary case (Ai > ^). From Eqs.(p^) and ( j85| ) we 
find that 



1 + E R (B D )-C = (l-A 1 )log(l-A 1 )-^A i logA i 

= (^A^log^A.O-^AilogA, 

i#i i/i 

= J^AiOogC^AjO-IogAi) 

> 0, (87) 

where in order to proceed from the third to the fourth step we have used the 
simple fact that log(A2 + A3 + A4) is greater than either of the terms log A2,log A3 
or logA4. Thus for all Bell diagonal states we have 

C<1 + E R . (88) 

Now we will point out a curious fact about the situation when any two of the 
eigenvalues of a Bell diagonal state (say A3 and A4) are zero. When Ai = A 2 = ^ 
(which means Er — 0), we have C= 1 (using Eq.(|85|)). In all the other cases 
(for which we use Eqs.([S4|) and (|85|)) we have 



1+E R (B D )-C = (l-Ai)log(l-Ai)-A 2 logA 2 

= 0, (89) 

where we have used the simple fact that Ai + A 2 = 1. Thus for all Bell diagonal 
states with only two nonzero eigenvalues we have 

C = 1 + E R . (90) 

On the basis of the results obtained in this section let us conjecture: 
Conjecture : The C for SDC with completely general (possibly mixed) 

states is bounded from above by 1 + Er. 

A great deal of empirical evidence has been presented in this section in 

support of the conjecture. In the next section we proceed to give a heuristic 

justification in support of our conjecture. In fact, we will try to justify an even 

stronger upper bound on C. 
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9 SDC and purification 



To justify the conjecture of the previous section, we will have to examine the 
following interesting question: How does the capacity C change if Alice and 
Bob first locally purify their ensemble and distill Bell states (following the op- 
timal purification procedure) and follow this up by SDC? Various purification 
procedures have been described in Refs.fi], [fl], [l^, Here we assume that 
Alice and Bob follow the optimal purification process: One which helps them 
to distill the maximum fraction of Bell states from the initial ensemble. They 
will, after optimal purification, have a fraction Ed (where Ed is called the en- 
tanglement of distillation) shared pairs in Bell states and a fraction (1 — Ed) 
pairs in a disentangled state. They now complete their 'purification' process by 
converting the final subensemble of disentangled pairs to pure states by projec- 
tive measurements. We refer to such a protocol as complete purification. After 
a complete purification, Alice can use the fraction Ed of Bell pairs to send Bob 
information at the rate of 2 bits/pair and the fraction (1 — Ed) of pure disen- 
tangled pairs to send information at the rate of 1 bit/pair. Thus if Alice and 
Bob initially shared n pairs, the classical capacity after a complete and optimal 
purification procedure is 

C = -{2nE D + n{\- E D )} 
n 

= l + E D . (91) 

Note that the above result is only asymptotically true (n — > oo). Naively, one 
might expect this C to be lower than the C before purification. This is because, 
as mentioned in Ref . , entanglement concentration is a more destructive pro- 
cess than quantum data compression. Some amount of shared entanglement is 
destroyed in the process of purification. So it might be expected that the final 
ensemble after a purification will be able to convey less classical information 
than the original ensemble. On the contrary, as we will justify, the capacity 
for SDC with the purified ensemble is greater than that with the unpurificd 
ensemble when an optimal and complete purification protocol is used. The gain 
comes from the fact that now there are two separate sub-ensembles instead of 
a single ensemble. In this section, we will try to justify, albeit heuristically, the 
following statement (as a more fundamental conjecture than the one presented 
in the previous section): 

Conjecture: The capacity for SDC is more when it is preceeded by a complete 
and optimal purification procedure. 

We will first consider the special case of Bell diagonal mixed states, for 
which the proof of the increase in the SDC capacity on a complete and optimal 
purification (our conjecture) can be rigorously proved. For Bell diagonal states 
with entropy S(p) < 1, there is a purification protocol called hashing (with the 
distillable fraction being Edh = 1 — S(p)) pj. /,From Eq.(|85|) we see that for 
these Bell diagonal states, the C for SDC is equal to 1 + Edh- As hashing may 
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not necessarily be the optimal protocol, we have Edh < Ed- This immediately 
implies C < 1 + Ed- For the complementary case of Bell diagonal states with 
S(p) > 1, we have (from Eq.(f|) for SDC), C < 1. This is obviously less 
than 1 + Ed for any finite value of the entanglement of distillation (i.e for all 
inseperable states p^|). Thus for all Bell diagonal states the C for SDC can 
be improved by a prior optimal and complete purification of the ensemble of 
shared states. 

For the more general case of arbitrary mixed entangled states, the proof 
will, essentially, be heuristic. Our approach will be to split the change in the 
classical information capacity due to complete and optimal purification into two 
parts. The first part is positive (an increase in capacity) and due to the ad- 
dition of classical side channels during the purification procedure. These side 
channels are used by Alice (A) to communicate the results of her measurements 
to Bob (B), or vice versa. As this communication during purification is already 
conveying information from A to B (or vice versa), the channel capacity of the 
classical side channels should be directly added to the classical capacity (of 
course, for this, we have to implicitly assume the additivity of classical capaci- 
ties). The information conveyed from A to B (or vice versa) during purification 
is actually used by the two parties to precisely identify their entangled and 
disentangled subensembles. There is also a negative contribution (decrease in 
capacity) during the purification as a fraction 1 — Ed of shared pairs loose all 
their entanglement. Of course a part of this entanglement is pumped into the 
entangled subensemble, but the remaining is lost. Our job will be to argue that 
the positive contribution in channel capacity due to addition of classical side 
channels outweighs the negative contribution due to loss of entanglement in an 
optimal and complete purification procedure. 

Consider the process of purification as a process of information gain by A 
and B. Before the purification, both A and B have an equal amount of knowledge 
about their shared system (they both know the full density matrix). Finally, 
each of their shared states are pure (because of the completeness of the pu- 
rification procedure) and both of them have equal knowledge (i.e each know 
which of the pairs form the disentangled subensemble and which of the pairs 
form the maximally entangled subensemble). Thus they have both gained an 
equal amount of information I/pair about their shared pairs during the process 
of purification. Now, acquiring a part of this information may not cause any 
destruction of shared entanglement (lets call it Ii/pair), while the remaining 
part (lets call it 12/pair) does cause a lowering of shared entanglement. We 
now contend, that this part, I^/pair has to be equal to the number of classical 
communication channels used in the optimal purification procedure. The logic 
follows from the fact that the optimal strategy would be for A and B to collabo- 
rate in such a way that both of them gain the entire information I2 /pair with the 
least destruction of entanglement. They must then each acquire only a fraction 
of the information complementary to the fraction acquired by the other. In this 
way, the total information acquired by A and B from the shared pairs through 
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direct entanglement degrading measurements is 12/pair. They can then use a 
minimum of I2 classical side channels per pair to communicate their fraction of 
the acquired information to the other party. On the other hand, if each wanted 
to acquire a fraction of the total information by direct measurements which 
was not entirely complementary to the part acquired by the other, they would 
destroy more entanglement than is really necessary. Thus we would expect that 
the classical side channels contribute to boosting the capacity up by at least an 
amount I 2 /pair in the optimal purification procedure. 

Now consider how much the capacity decreases due to the degradation of 
shared entanglement during the purification procedure. The classical capacity 
of a fraction 1 — Ed of the pairs drops from at most 2 bits /pair to 1 bit /pair (due 
to the completeness of the purification procedure, the final capacity cannot be 
lower than 1 bit/pair). Thus the drop in classical capacity of each of the finally 
disentangled pairs is not more than 1 bit. This implies that the net decrease in 
classical capacity of the entire ensemble due to loss of entanglement hasn't been 
more than 1 — Ed bits/pair. 

When the information I2 is greater than 1 bit/pair, the proof of our statement 
is straightforward. The increase in capacity due to classical side channels (i.e 
12/pair) is more than 1 bit/pair, while the decrease in capacity due to loss 
of entanglement is less than 1 — Ed bits/pair. As 1 — Ed is a fraction (i.c 
1 — Ed < 1), the increase in capacity on purification overrides the inevitable 
decrease in capacity due to degradation of shared entanglement. Clearly, thus 
the capacity to do SDC increases on prior purification when ±2 > 1 bit. 

Now, we have to show that no matter what the value of I2 is, it is pos- 
sible to find a purification protocol for which I2 is greater than 1 — Ed bits. 
In other words, if initially there are n shared pairs (with n being large), then 
the number of shared pairs which loose all their entanglement on gaining an 
information of 12/pair can be made less than 7T2 by an appropriate choice of 
the purification protocol. From such a statement it directly follows that the 
increase in capacity on purification overrides the inevitable decrease in capacity 
due to degradation of shared entanglement. A simple way in which one may try 
to justify the above proposition is from the fact that one pair generally allows 
the extraction of upto two bits of information (one bit from each of A's and B's 
qubit). So, it should be possible, in general, to extract n±2 amount of informa- 
tion from measurements on less than ni2 shared pairs (whose entanglement is 
degraded). This intutive understanding leaves open the question as to whether 
the information ±2 /pair relevant to purification can be acquired in this way. For 
that we resort to explicit exemplification. Consider the class of mixed states 
p (having nonzero entropy S(p)) and a diagonal decomposition ^2 X^i) (ipil 
in terms of the pure states \ipi)- Also suppose that this diagonal decomposi- 
tion does not coincide with the entanglement minimizing decomposition p0| of 
p. For such states, acquiring the information about the entropy (S*(p)/pair) is 
essential during purification, as the final ensemble is pure. We now contend 
that acquiring this S(p) /pair of information necessarily destroys some amount 
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of shared entanglement. We will justify this by the method of contradiction. 
Suppose, it was really possible to acquire the information about the entropy 
S(p) without destroying any entanglement. On acquiring an information of the 
amount equal to 5(p)/pair, A and B will be able to divide their initial ensemble 
to four seperate pure subensembles, each being comprised of one of the states 
\ipi). The weight of each of these subensembles will be equal to the eigenvalues 
Aj. The average entanglement shared by A and B after extracting the infor- 
mation 5(p)/pair is thus KE^dipi)). However, for the class of states p that 
we are considering, ^2 XiE v (\ipi}) is necessarily greater than the initial shared 
entanglement (as quantified by the entanglement of formation Ep). Thus by 
purely local actions and classical communications, A and B have been able to 
increase the shared entanglement for these classes of mixed states. This contra- 
dicts the definition of entanglement as a quantity that cannot be increased by 
local actions and classical communications. Thus, mixed states with the diagonal 
decomposition not coincident with the entanglement minimizing decomposition 
cannot be locally purified without necessarily destroying some shared entangle- 
ment. Thus the information about the entropy of these states is of the type I2 
(i.e acquiring the information necessarily causes degradation of entanglement). 
A corollary which immediately follows (though not directly relevant to the main 
point of the paper), is: the optimal purification of mixed states with the diagonal 
decomposition not coincident with the entanglement minimizing decomposition 
necessarily has a nonzero I2 and thus necessarily requires the use of classical 
side channels. Now consider such states when their entropy, and thereby ±2, 
exceeds 1 bit. Thus the total information 71I2 needed to be acquired is greater 
than n bits. However, purifiability (nonzero Ed) implies that the total entan- 
glement loss can be entirely concentrated to the entanglement loss of n(l — Ed) 
pairs which is less than 71I2 pairs. Therefore, it is possible to gain the relevant 
nl.2 bits of information by destroying the entanglement of less than ni2 pairs. 
Now we really need an unproven extension of the above statement: even when 
I2 < 1 bit and irrespective of the nature of the decomposition of p, it is possible 
to gain the relevant nl2 amount of information by destroying the entanglement 
of less than nl2 pairs. Though this may seem a large extrapolation, it seems 
highly plausible because one shared pair allows the extraction of upto two bits of 
information. Thus we would expect that, no matter what the value of I2 is, the 
increase in capacity on purification overrides the inevitable decrease in capacity 
due to degradation of shared entanglement. 

Thus the capacity to do SDC with the two pure subenembles (maximally 
entangled and fully disentangled) after a complete and optimal purification, is 
expected to be greater than the initial mixed (and uniformly entangled) ensem- 
ble. We thus conjecture, with the help of Eq.(|9l|), the following bound on the 
capacity of doing SDC with any mixed entangled state 

C<l + E D . (92) 
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This is an even stronger upper bound on the classical capacity than the 
bound conjectured in the previous section as the relative entropy of entangle- 
ment Er is necessarily greater than Ep Jl^]- From Eq.(p^), it directly follows 
that C < 1 + Er (our previous conjecture). Note that in this section, this 
conjecture has been rigourously proved for all Bell diagonal states, heuristically 
proved for all states with entropy S(p) > 1 bit (i.e I 2 > 1 bit) and shown to be 
highly plausible for other types of states. 

10 Conclusion 

In this paper we have obtained bounds on the capacity of doing dense coding 
in terms of the different measures of entanglement stemming from purification 
procedures. The rigorously proved part of the results of this paper is that for 
SDC one always has 

Er < C < 1 + E F . (93) 

On the other hand if we allow for a conjecture (well supported by examples and 
heuristically justified in Section.^), we have the stronger result 

E R < C < 1 + E R . (94) 

What is the significance of bounds on the C for SDC when it can be readily 
calculated? The importance can be realized when we invert the above equation 
and write 

C - 1 < E R < C. (95) 

Thus by calculating (or measuring) the channel capacity for SDC, we can draw 
an inference about the range in which the entanglement of the shared states 
(as quantified by Er) lies. Also one of the bounds, namely, C < 1 + Ef 
continues to hold even when we are allowed to vary the a priori probability of 
the various signal states (GDC). Thus, our inequality allows one to impose a 
readily calculable (from the expression for Ep in Ref . |pOf ) limit on the capacity 
for GDC, without having to optimize GDC over all possible a priori signal 
probabilities. 

The physical interpretation of the upper bounds becomes clear from the 
considerations given in Section.^. During optimal and complete entanglement 
purification procedures one generally enhances the classical capacity much more 
due to added classical side channels than the inevitable decrease in capacity 
brought by the loss of entanglement. The information transferred through these 
classical side channels during purification helps to identify the entangled and 
disentangled subensembles. It is through this identification that they directly 
play the role of boosting the capacity. As after an optimal and complete en- 
tanglement purification procedure, the capacity is 1 + Eo, this post-purification 
capacity is an upper bound on the pre-purification capacity. As both the entan- 
glement measures Ep and Er are upper bounds on Ep,, all our upper bounds 
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follow immediately. It is interesting to note that the upper and the lower bounds 
are trivial in complementary regimes. When C < 1, the upper bounds are triv- 
ial, but the lower bound (Er < C) is non trivial. On the other hand when 
C > 1, the upper bounds are non trivial while the lower bound is trivial. 

We believe that this paper imparts physical significance to the various mea- 
sures of entanglement from the viewpoint of forming bounds on certain kinds 
of dense coding procedures. This direction of physical interpretation of the en- 
tanglement measures is very different from the standard interpretations which 
stem from entanglement dilution and distillation processes. In a sense, this 
paper links up the apparently disconnected notions of entanglement purification 
and dense coding. The considerations of the previous section implies the follow- 
ing lower bound on the distillable entanglement: Ed > C — 1. This is important 
because as yet there is no explicit formula for Ed for a general mixed entangled 
state. Thus the readily calculable quantity C — 1 (in the case of SDC) may offer 
a convinient lower bound on the distillable entanglement of a given mixed state. 

The interesting question to examine is what apart from entanglement is in- 
volved in determining the dense coding capacity. This may be some measure 
of distinguishability between the letter states (hinted by the fact that the av- 
erage mutual distinguishability function forms an upper bound on the classical 
capacity C). An aim of further research should be to work towards a complete 
formula for the capacity of dense coding. Moreover, working with a similar moti- 
vation as this paper, one can also try to relate entanglement measures stemming 
from purification procedures to the other uses of shared entanglement such as 
teleportation p3| and secret key distribution p4j . 

11 Acknowledgements 

We would like to thank Vladimir Buzek, Daniel Jonathan and Peter Knight for 
valuable discussions. This work is supported by the Inlaks Foundation, Elsag- 
Bailey, Hewlett-Packard, the European TMR networks ERB 4061PL951412 
and ERBFMRXCT96066, the UK Engineering and Physical Sciences Research 
Council, the European Science Foundation and the Leverhumc trust. 

References 

[1] For a review see: Plenio, M. B., and Vedral, V., 1998, Cont. Phys., 39, 
431. 

[2] Bennett, C. H., and Wiesner, S., 1992, Phys. Rev. Lett, 69, 2881. 

[3] Mattle, K., Weinfurter, H., Kwiat, P. G., and Zeilinger, A., 1996, Phys. 
Rev. Lett., 76, 4656. 

[4] Barenco A., and Ekert, A. K., 1995, J. Mod. Opt, 42, 1253. 



21 



[5] Hausladen, P., Jozsa, R., Schumacher, B., Westmoreland, M., and Woot- 
ters, W. K., 1996, Phys. Rev. A, 54, 1869. 

[6] Bose, S., Vedral, V., and Knight, P. L., 1998, Phys. Rev. A, 57, 822. 

[7] Bennett, C. H., DiVincenzo, D. P., Smolin, J. A., and Wootters, W. K., 

1996, Phys. Rev. A, 54, 3824. 

[8] Vedral, V., Plenio, M. B., Rippin, M. A., and Knight, P. L., 1997, Phys. 
Rev. Lett., 78, 574. 

[9] Vedral, V., Plenio, M. B., Jacobs, K., and Knight, P. L., 1997, Phys. Rev. 
A 56, 4452. 

[10] Vedral, V., Plenio, M. B., 1998, Phys. Rev., A 57, 1619. 

[11] Bennett, C. H., Brassard, G., Popescu, S., Schumacher, B., Smolin, J. A., 
and Wootters, W. K., 1996, Phys. Rev. Lett., 76, 722. 

[12] Deutsch, D., Ekert, A., Jozsa, R., Macchiavello, C, Popescu, S., and San- 
pera, A., 1996, Phys. Rev. Lett, 77, 2818. 

[13] Bennett, C. H., Bernstein, H. J., Popescu, S., and Schumacher, B., 1996, 
Phys. Rev. A, 53, 2046. 

[14] Kholevo, A. S., 1973, Probl. Peredachi Inf 9, 177 [Kholevo, A. S., 1973, 
Problems of Information Transmission, 9, 177 ]; The Holevo bound was 
first conjectured by: Gordon, J. P., 1964, Quantum Electronics and Co- 
herent light, Proceedings of the International School of Physics "Enrico 
Fermi," Course XXXI, edited by P. A. Miles (Academic, New York), 156- 
181; Levitin, L. B., 1964, Information, Complexity and Control in Quantum 
Physics, edited by P. A. Miles, (Academic, New York), 111-115. 

[15] Ohya, M., 1989, Rep. Math. Phys., 27, 19; Hiai, F., and Petz, D., 1991, 
Comm. Math. Phys., 143, 99; Donald, M. J., 1986, Comm. Math. Phys., 
105, 13; Donald, M. J., 1987, Math. Proc. Camb. Phil. Soc, 101, 363. 

[16] Holevo, A. S., 1998, IEEE Trans. Info. Th., 44, 269; Schumacher, B., 
Westmoreland, M. D., 1997, Phys. Rev. A, 56, 131. 

[17] Popescu, S., and Rohrlich, D., 1997, Phys. Rev. A, 56, R3319. 

[18] Horodecki, R., Horodecki, P., and Horodecki, M., 1995, Phys. Lett. A, 200, 
340. 

[19] Rains, E. M., Entanglement purification via separable superoperators, 

1997, LANL e-print |quant-ph/9707002| . 

[20] Wootters, W. K., 1998, Phys. Rev. Lett, 80, 2245. 



22 



[21] Werner, R. F., 1989, Phys. Rev. A, 40, 4277. 

[22] Horodecki, M., Horodecki, P., and Horodecki, R., 1997, Phys. Rev. Lett., 
78, 574. 

[23] Bennett, C. H., Brassard, G., Crepeau, C, Jozsa, R., Peres, A., and Woot- 
ters, W. K., 1993, Phys. Rev. Lett, 70, 1895. 

[24] Ekcrt, A. K., 1991, Phys. Rev. Lett, 67, 661. 



23 



Figure 1: Two different functions F(Er) of the relative entropy of entanglement 
Er are plotted. The continuous line shows the capacity C for SDC as a function 
of the relative entropy of entanglement Er if the initial shared state is a lambda 
state of the type A. The dashed line shows the function 1 + Er. The figure 
illustrates that the capacity C for SDC is bounded from above by 1 + Er 
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Figure 2: The continuous line shows how the capacity C for SDC varies with 
the relative entropy of entanglement Er of the initial shared Werner state. The 
dashed line shows that it is bounded from above by 1 + Er 
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